Chinese Main Verb Identification: From Specification to Realization

نویسندگان

  • Binggong Ding
  • Changning Huang
  • Degen Huang
چکیده

Main verb identification is the task of automatically identifying the predicate-verb in a sentence. It is useful for many applications in Chinese Natural Language Processing. Although most studies have focused on the model used to identify the main verb, the definition of the main verb should not be overlooked. In our specification design, we have found many complicated issues that still need to be resolved since they haven’t been well discussed in previous works. Thus, the first novel aspect of our work is that we carefully design a specification for annotating the main verb and investigate various complicated cases. We hope this discussion will help to uncover the difficulties involved in this problem. Secondly, we present an approach to realizing main verb identification based on the use of chunk information, which leads to better results than the approach based on part-of-speech. Finally, based on careful observation of the studied corpus, we propose new local and contextual features for main verb identification. According to our specification, we annotate a corpus and then use a Support Vector Machine (SVM) to integrate all the features we propose. Our model, which was trained on our annotated corpus, achieved a promising F score of 92.8%. Furthermore, we show that main verb identification can improve the performance of the Chinese Sentence Breaker, one of the applications of main verb identification, by 2.4%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

“Those Nation Wreckers are Suffering from Inferiority Complex”: The Depiction of Chinese Miners in the Ghanaian Press

This article studies the depiction of Chinese miners in the Ghanaian news website entitled Modern Ghana. A total of 87 articles comprising 43752 words were retrieved. Van Leeuwen’s (2008) theory of the representation of the social actors was utilised to examine the depiction of Chinese miners in the Ghanaian press. In this regard, six applicable tools were used and these include exclusion, role...

متن کامل

Lexicalization Typology of Realization Events in Mandarin Chinese

There has been a hot debate on the typological status of Mandarin Chinese in Talmyan framework of Verb-framed languages (V-languages) and Satellite-framed Languages(S-languages). However, most previous studies focus on motion events, while other macro-events (Talmy, 2000) receive little attention. The present study aims to investigate event of realization in Mandarin Chinese with experimental m...

متن کامل

Chinese Resultative Verb Compounds: Lexicalization and Grammaticalization

This paper is an historical study of the formation of the Chinese resultative verb compounds (RVCs) that signal a resultant state of a non-agent with a V1V2 predicate. Metaphorization and metonymization, understood within the theoretical framework of Brinton & Traugott (2005), are proposed to have played a most important role in the formation of the RVC in Middle Chinese. Many scholars noted (W...

متن کامل

A Constructional Approach to Argument Realization of Chinese Resultatives

This paper argues for a constructional view of the resultative constructions, specifically the resultative-verb compounds (henceforth RVCs). We claim that the effect of the construction must be taken into account in the realization of arguments, and the realization must be moderated by the linking rules. This paper is organized as follows: Section 2 provides a brief definition of resultatives i...

متن کامل

Motion events in Chinese novels: Evidence for an equipollently-framed language

Motion events typically involve an entity moving along a path in a certain manner. Research on language typology has identified three types of languages based on the characteristic expression of manner and path information. In satellite-framed languages, the main verb expresses information about manner of movement and a subordinate satellite element (e.g., a verb particle) to the verb conveys t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJCLCLP

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2005